Hokkaido University Research Profiles

Japanese

Online optimization: 1

Numbers of lines 20 50 No Page Break Theme Icons
  • Life Sciences
  • Information and Communication
  • Nanotechnology / Materials
  • Manufacturing Technology
  • Human and Social Sciences
  • Energy
  • Environment
  • Tourism / Community development
  • Arctic Research
  • Social Infrastructure
  • Open Facilities
  • Recommendation Techniques Using the Bandit Method

    Online learning technology that maximizes cumulative gain while acquiring knowledge

    We are researching a recommendation method that maximizes the user's cumulative satisfaction, not only by recommending items that the user may prefer (use of knowledge), but also items that may provide more information about the user's preferences (acquisition of knowledge) in a balanced manner.

    Research

    In today's internet society, recommendation technology, if it works well, can benefit both the provider and the receiver of the service. A recommendation service is not a one-time event, but an iterative process with feedback each time, and the feedback only concerns the items that are recommended. Therefore, to increase the accuracy of subsequent recommendations, it is not only important to recommend items that the user is likely to like based on the feedback history (knowledge utilization), but also items from which the user is likely to acquire more information (knowledge acquisition). The Bandit method attempts to maximize user satisfaction by balancing the use and acquisition of knowledge. We are developing a recommendation system using this method.